Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
نویسنده
چکیده
Many important traits in plant breeding are polygenic and therefore recalcitrant to traditional marker-assisted selection. Genomic selection addresses this complexity by including all markers in the prediction model. A key method for the genomic prediction of breeding values is ridge regression (RR), which is equivalent to best linear unbiased prediction (BLUP) when the genetic covariance between lines is proportional to their similarity in genotype space. This additive model can be broadened to include epistatic effects by using other kernels, such as the Gaussian, which represent inner products in a complex feature space. To facilitate the use of RR and nonadditive kernels in plant breeding, a new software package for R called rrBLUP has been developed. At its core is a fast maximum-likelihood algorithm for mixed models with a single variance component besides the residual error, which allows for effi cient prediction with unreplicated training data. Use of the rrBLUP software is demonstrated through several examples, including the identifi cation of optimal crosses based on superior progeny value. In cross-validation tests, the prediction accuracy with nonadditive kernels was signifi cantly higher than RR for wheat (Triticum aestivum L.) grain yield but equivalent for several maize (Zea mays L.) traits. THE ABILITY TO PREDICT COMPLEX TRAITS from marker data is becoming increasingly important in plant breeding (Bernardo, 2008). Th e earliest attempts, now over 20 years old, involved fi rst identifying signifi cant markers and then combining them in a multiple regression model (Lande and Th ompson, 1990). Th e focus over the last decade has been on genomic selection methods, in which all markers are included in the prediction model (Bernardo and Yu, 2007; Heff ner et al., 2009; Jannink et al., 2010). One of the fi rst methods proposed for genomic selection was ridge regression (RR), which is equivalent to best linear unbiased prediction (BLUP) in the context of mixed models (Whittaker et al., 2000; Meuwissen et al., 2001). Th e basic RR-BLUP model is
منابع مشابه
A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
One objective of this study was to provide readers with a clear and unified understanding of parametric statistical and kernel methods, used for genomic prediction, and to compare some of these in the context of rice breeding for quantitative traits. Furthermore, another objective was to provide a simple and user-friendly R package, named KRMM, which allows users to perform RKHS regression with...
متن کاملGenomic Regions Introgressed in a Set of Bean NILs
genome-wide prediction approaches of the performance of threeway hybrids. Using an empirical barley data set, Li et al. showed that applying the developed genomic selection model allowing for subpopulation-specific marker effects (GSA-RRBLUP) tripled the prediction ability of three-way hybrids from 0.095 to 0.308 compared with the ridge regression best linear unbiased prediction approach that m...
متن کاملTitle: Accuracy of Genomic Selection Methods in a Standard Dataset of Loblolly Pine
Genomic selection can increase genetic gain per generation through early selection. Genomic selection is expected to be particularly valuable for traits that are costly to phenotype, and expressed late in the life-cycle of long-lived species. Alternative approaches to genomic selection prediction models may perform differently for traits with distinct genetic properties. Here the performance of...
متن کاملGenome-Wide Prediction of the Performance of Three-Way Hybrids in Barley.
Predicting the grain yield performance of three-way hybrids is challenging. Three-way crosses are relevant for hybrid breeding in barley ( L.) and maize ( L.) adapted to East Africa. The main goal of our study was to implement and evaluate genome-wide prediction approaches of the performance of three-way hybrids using data of single-cross hybrids for a scenario in which parental lines of the th...
متن کاملGenomic selection for QTL-MAS data using a trait-specific relationship matrix
BACKGROUND The genomic estimated breeding values (GEBV) of the young individuals in the XIV QTL-MAS workshop dataset were predicted by three methods: best linear unbiased prediction with a trait-specific marker-derived relationship matrix (TABLUP), ridge regression best linear unbiased prediction (RRBLUP), and BayesB. METHODS The TABLUP method is identical to the conventional BLUP except that...
متن کامل